Search CORE

116 research outputs found

Investigating Rumor News Using Agreement-Aware Search

Author: Abadi M.
Barthel M.
Connolly K.
Dunteman G. H.
Ferreira W.
Mohammad S.
Mohammad S.
Skeppstedt M.
Voorhees E. M.
Wei W.
Xu R.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 16/09/2018
Field of study

Recent years have witnessed a widespread increase of rumor news generated by humans and machines. Therefore, tools for investigating rumor news have become an urgent necessity. One useful function of such tools is to see ways a specific topic or event is represented by presenting different points of view from multiple sources. In this paper, we propose Maester, a novel agreement-aware search framework for investigating rumor news. Given an investigative question, Maester will retrieve related articles to that question, assign and display top articles from agree, disagree, and discuss categories to users. Splitting the results into these three categories provides the user a holistic view towards the investigative question. We build Maester based on the following two key observations: (1) relatedness can commonly be determined by keywords and entities occurring in both questions and articles, and (2) the level of agreement between the investigative question and the related news article can often be decided by a few key sentences. Accordingly, we use gradient boosting tree models with keyword/entity matching features for relatedness detection, and leverage recurrent neural network to infer the level of agreement. Our experiments on the Fake News Challenge (FNC) dataset demonstrate up to an order of magnitude improvement of Maester over the original FNC winning solution, for agreement-aware search

arXiv.org e-Print Archive

Crossref

Holistic corpus-based dialectology

This paper is concerned with sketching future directions for corpus-based dialectology. We advocate a holistic approach to the study of geographically conditioned linguistic variability, and we present a suitable methodology, 'corpusbased dialectometry', in exactly this spirit. Specifically, we argue that in order to live up to the potential of the corpus-based method, practitioners need to (i) abandon their exclusive focus on individual linguistic features in favor of the study of feature aggregates, (ii) draw on computationally advanced multivariate analysis techniques (such as multidimensional scaling, cluster analysis, and principal component analysis), and (iii) aid interpretation of empirical results by marshalling state-of-the-art data visualization techniques. To exemplify this line of analysis, we present a case study which explores joint frequency variability of 57 morphosyntax features in 34 dialects all over Great Britain

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Directory of Open Access Journals

The University of Manchester - Institutional Repository

Silicon Nanowire Sensors Enable Diagnosis of Patients via Exhaled Breath

Author: Douglas W. Johnson
Dunteman G. H.
Enrique S. Pariente
Gerald Brönstrup
Hossam Haick
John C. Cancilla
Jose S. Torrecilla
Knoerzer K.
Marcis Leja
Michael P. A. Davies
Nir Peled
Nisreen Shehada
Ori Liran
Silke Christiansen
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2016
Field of study

Two of the biggest challenges in medicine today are the need to detect diseases in a noninvasive manner and to differentiate between patients using a single diagnostic tool. The current study targets these two challenges by developing a molecularly modified silicon nanowire field effect transistor (SiNW FET) and showing its use in the detection and classification of many disease breathprints (lung cancer, gastric cancer, asthma, and chronic obstructive pulmonary disease). The fabricated SiNW FETs are characterized and optimized based on a training set that correlate their sensitivity and selectivity toward volatile organic compounds (VOCs) linked with the various disease breathprints. The best sensors obtained in the training set are then examined under real-world clinical conditions, using breath samples from 374 subjects. Analysis of the clinical samples show that the optimized SiNW FETs can detect and discriminate between almost all binary comparisons of the diseases under examination with >80% accuracy. Overall, this approach has the potential to support detection of many diseases in a direct harmless way, which can reassure patients and prevent numerous unpleasant investigations

University of Liverpool Repository

Feature extraction and selection for Arabic tweets authorship authentication

Author: A Abbasi
A Abbasi
A Pasha
Abdullateef Rabab’ah
AS Altheneyan
CC Aggarwal
E Stamatatos
E Stamatatos
F Mosteller
G Hirst
G Kanaan
GH Dunteman
H Sayoud
HC Chen
I Kononenko
JT Kent
M Hall
M Koppel
Mahmoud Al-Ayyoub
ML Brocardo
Monther Aldwairi
MS Khorsheed
N Cheng
O Vel De
P Juola
P Juola
P Kosmides
RS Baraka
T Helmy
W Deitrick
Yaser Jararweh
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2017
Field of study

© 2017, Springer-Verlag Berlin Heidelberg. In tweet authentication, we are concerned with correctly attributing a tweet to its true author based on its textual content. The more general problem of authenticating long documents has been studied before and the most common approach relies on the intuitive idea that each author has a unique style that can be captured using stylometric features (SF). Inspired by the success of modern automatic document classification problem, some researchers followed the Bag-Of-Words (BOW) approach for authenticating long documents. In this work, we consider both approaches and their application on authenticating tweets, which represent additional challenges due to the limitation in their sizes. We focus on the Arabic language due to its importance and the scarcity of works related on it. We create different sets of features from both approaches and compare the performance of different classifiers using them. We experiment with various feature selection techniques in order to extract the most discriminating features. To the best of our knowledge, this is the first study of its kind to combine these different sets of features for authorship analysis of Arabic tweets. The results show that combining all the feature sets we compute yields the best results

ZU Scholars (Zayed University)

Crossref

Capacidade de combinação de características produtivas de linhagens de matrizes de frangos de corte, usando a técnica de componentes principais

Author: Cosme Damião Cruz
CRUZ C. D.
CRUZ C. D.
CRUZ C. D.
DUNTEMAN G. H.
FONSECA C. G.
FREITAS R. T. F.
JOHNSON R. A.
MANLY B. F. J.
MARDIA K. V.
Martinho de Almeida e Silva
MORRISON D. F.
Paulo Giovanni de Abreu
SAKAGUTI E. S.
Valéria Maria Nascimento Abreu
Élsio Antônio Pereira de Figueiredo
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref

Discrimination in lexical decision.

Author: A Stefanowitsch
AJ Parker
BC Love
C Burgess
C Leys
C Shaoul
CJ Marsolek
CR Oehrn
D Danks
D Norris
D Norris
D Norris
D Norris
DA Balota
DE Knuth
E Beyersmann
F Moscoso del Prado Martín
F Moscoso del Prado Martín
F Rosenblatt
G Recchia
GA Miller
GE Bodner
GE Booij
GH Dunteman
H Kučera
Hedderik van Rijn
J Friedman
J Friedman
J Friedman
J Heister
JA Dunabeitia
JL McClelland
JP Blevins
JS Bowers
JS Burt
K Lund
K Mulder
K Rastle
K Rastle
KY Chan
L Bauer
Laurie Beth Feldman
LB Feldman
LB Feldman
LB Feldman
LG Allan
LH Wurm
M Brysbeart
M Coltheart
M Marelli
M Minsky
M Ramscar
M Ramscar
M Ramscar
M Ramscar
M Ramscar
M Ramscar
M Taft
M Taft
M Taft
M Taft
M Taft
M Taft
MEJ Masson
Michael Ramscar
MM Botvinick
MN Shadlen
MS Vitevitch
MW Harm
NC Ellis
P Milin
P Milin
PC Trimmer
Petar Milin
Peter Hendrix
PH Matthews
R Romo
R Schreuder
R Schreuder
R Schreuder
R. Harald Baayen
RA Rescorla
RA Rescorla
RH Baayen
RH Baayen
RH Baayen
RH Baayen
RH Baayen
RQ Quiroga
RR Miller
S Andrews
S Andrews
S Andrews
S Waydo
SN Wood
T Yarkoni
TK Landauer
TL Griffiths
WJM Levelt
Z Chen
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2017
Field of study

In this study we present a novel set of discrimination-based indicators of language processing derived from Naive Discriminative Learning (ndl) theory. We compare the effectiveness of these new measures with classical lexical-distributional measures-in particular, frequency counts and form similarity measures-to predict lexical decision latencies when a complete morphological segmentation of masked primes is or is not possible. Data derive from a re-analysis of a large subset of decision latencies from the English Lexicon Project, as well as from the results of two new masked priming studies. Results demonstrate the superiority of discrimination-based predictors over lexical-distributional predictors alone, across both the simple and primed lexical decision tasks. Comparable priming after masked corner and cornea type primes, across two experiments, fails to support early obligatory segmentation into morphemes as predicted by the morpho-orthographic account of reading. Results fit well with ndl theory, which, in conformity with Word and Paradigm theory, rejects the morpheme as a relevant unit of analysis. Furthermore, results indicate that readers with greater spelling proficiency and larger vocabularies make better use of orthographic priors and handle lexical competition more efficiently

Crossref

University of Birmingham Research Portal

Directory of Open Access Journals

PubMed Central

White Rose Research Online